Similarity Learning in Nearest Neighbor and Application to Information Retrieval

نویسنده

Ali Mustafa Qamar

چکیده

Many people have tried to learn Mahanalobis distance metric in kNN classification by considering the geometry of the space containing examples. However, similarity may have an edge specially while dealing with text e.g. Information Retrieval. We have proposed an online algorithm, SiLA (Similarity learning algorithm) where the aim is to learn a similarity metric (e.g. cosine measure, Dice and Jaccard coefficients) and its variation eSiLA where we project the matrix learnt onto the cone of positive, semidefinite matrices. Two incremental algorithms have been developed; one based on standard kNN rule while the other one is its symmetric version. SiLA can be used in Information Retrieval where the performance can be improved by using user feedback.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Nearest Neighbor Methods For Text Classification

We present new nearest neighbor methods for text classification and an evaluation of these methods against the existing nearest neighbor methods as well as other well-known text classification algorithms. Inspired by the language modeling approach to information retrieval, we show improvements in k-nearest neighbor (kNN) classification by replacing the classical cosine similarity with a KL dive...

متن کامل

Optimizing Nearest Neighbor Retrieval by Similarity Template and Retrieval Query Generation

The nearest neighbor algorithm is the most basic class of techniques in the subelds of machine learning such as case-based reasoning (CBR), memory-based reasoning (MBR), and instance-based learning (IBL). In the nearest neighbor algorithm, the computational cost of example retrieval is one of the most important issues. This paper proposes a novel technique for optimizing the nearest neighbor al...

متن کامل

Improved Nearest Neighbor Methods For Text Classification With Language Modeling and Harmonic Functions

متن کامل

Signi cance-Sensitive Nearest-Neighbor Search for E cient Similarity Retrieval of Multimedia Information

Nearest-neighbor search (NN-search) in the feature space is widely used for the similarity retrieval of multimedia information. Each piece of multimedia information is mapped to a vector in a multi-dimensional space where the distance between two vectors (typically, Euclidean distance between the heads of vectors) corresponds to the similarity of multimedia information. Once the feature space i...

متن کامل

FUZZY K-NEAREST NEIGHBOR METHOD TO CLASSIFY DATA IN A CLOSED AREA

Clustering of objects is an important area of research and application in variety of fields. In this paper we present a good technique for data clustering and application of this Technique for data clustering in a closed area. We compare this method with K-nearest neighbor and K-means.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Similarity Learning in Nearest Neighbor and Application to Information Retrieval

نویسنده

چکیده

منابع مشابه

Improved Nearest Neighbor Methods For Text Classification

Optimizing Nearest Neighbor Retrieval by Similarity Template and Retrieval Query Generation

Improved Nearest Neighbor Methods For Text Classification With Language Modeling and Harmonic Functions

Signi cance-Sensitive Nearest-Neighbor Search for E cient Similarity Retrieval of Multimedia Information

FUZZY K-NEAREST NEIGHBOR METHOD TO CLASSIFY DATA IN A CLOSED AREA

عنوان ژورنال:

اشتراک گذاری